# ImageNet-21k Pretrained
Vit Base Patch16 224.orig In21k
Apache-2.0
An image classification model based on Vision Transformer, pretrained on ImageNet-21k, suitable for feature extraction and fine-tuning
Image Classification
Transformers

V
timm
23.07k
1
Vit Large Patch32 224.orig In21k
Apache-2.0
An image classification model based on Vision Transformer (ViT) architecture, pretrained on the ImageNet-21k dataset, suitable for feature extraction and fine-tuning scenarios.
Image Classification
Transformers

V
timm
771
0
Swin Base Patch4 Window7 224 In22k
Apache-2.0
Swin Transformer is a hierarchical window-based vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image classification tasks.
Image Classification
Transformers

S
microsoft
13.30k
15
Swin Base Patch4 Window12 384 In22k
Apache-2.0
Swin Transformer is a hierarchical vision Transformer based on shifted windows, specifically designed for image classification tasks.
Image Classification
Transformers

S
microsoft
2,431
1
Swin Large Patch4 Window12 384 In22k
Apache-2.0
Swin Transformer is a hierarchical window-based vision Transformer model, pretrained on the ImageNet-21k dataset, suitable for image classification tasks.
Image Classification
Transformers

S
microsoft
1,063
7
Swin Large Patch4 Window7 224 In22k
Apache-2.0
Swin Transformer is a hierarchical vision transformer based on shifted windows, pretrained on the ImageNet-21k dataset, suitable for image classification tasks.
Image Classification
Transformers

S
microsoft
387
2
Featured Recommended AI Models